Very large population text-independent speaker identification using transformation enhanced multi-grained models

نویسندگان

  • Upendra V. Chaudhari
  • Jirí Navrátil
  • Ganesh N. Ramaswamy
  • Stéphane H. Maes
چکیده

The paper presents results on speaker identification with a population size of over 10000 speakers. Speaker modeling is accomplished via our Transformation Enhanced MultiGrained Models. Pursuing two goals, the first is to study the performance of a number of different systems within the modeling framework of multi-grained models. The second is to analyze performance as a function of population size. We show that the most complex models within the framework perform the best and demonstrate that, in approximation, the identification error rate scales linearly with the log of the population size for the described system. Further, we develop a candidate rejection technique based on our analysis of the system performance which indicates a low confidence in the identity chosen.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transformation enhanced multi-grained modeling for text-independent speaker recognition

We describe our formulation of transformation enhanced data modeling used to develop a multi-grained data analysis approach to text independent speaker recognition. The broad goal is to address difficulties caused by sparse training and test data. First, our development of maximum likelihood transformation based recognition with diagonally constrained Gaussian mixture models is detailed. We giv...

متن کامل

Multi - Grained Modeling with Pattern Speci cMaximum

| We present a transformation based, multi-grained data modeling technique in the context of text independent speaker recognition, aimed at mitigating diicul-ties caused by sparse training and test data. Both identi-cation and veriication are addressed, where we view the entire population as divided into the target population and its complement, which we refer to as the background population. F...

متن کامل

Robust text-independent speaker identification using Gaussian mixture speaker models

This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...

متن کامل

Text Independent Speaker Identification Using Automatic Acoustic Segmentation

This paper describes an acoustic class dependent technique for text independent speaker identification on very short utterances. The technique is based on maximum likelihood estimation of a Gaussian mixture model representation of speaker identity. Gaussian mixtures are noted for their robustness as a parametric model and their ability to form smooth estimates of rather arbitrary underlying den...

متن کامل

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution

In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001